METHOD FOR PRODUCING BIOLOGICALLY ACTIVE HUMAN 
FACTOR VIII IN THE MILK OF TRANSGENIC ANIMALS DRIVEN BY 
MAMMARY-SPECIFIC EXPRESSION CASSETTES 

5 BACKGROUND OF THE INVENTION 

1. Field of the invention 

The present invention discloses a method for producing biologically active 
recombinant proteins in the milk of transgenic animals, characterized in which 
10 intact human clotting factor VIII gene and B-domain deleted recombinant factor 
VIII gene are transferred into the mammal by gene microinjection and embryonic 
implantation to obtain expression and secretion in the milk of transgenic animals 
and their offsprings. 

2. Description of the prior art 

15 Hemophilia is the most common human bleeding disorder and affects 

approximately one in 5,000 males, causing lifelong, repeated, and potentially life- 
threatening hemorrhagic episodes. It results from a deficiency in functional blood 
coagulation factors, either in a serine protease called Factor IX (hemophilia B) or 
in its cofactor Factor VIII (hemophilia A). Because hemophilia is caused by 

20 defects in single X-linked genes that encode circulating plasma proteins, the 
development of therapeutic strategies has focused on finding ways to replace the 
defective or deficient proteins. 

Human FVIII is synthesized primarily in the liver and secreted into the 
circulation at low plasma concentration, approximately 100-200 ng/ml. Analysis 

25 of the cloned cDNA for human FVIII (Gitschier et al., 1984, Nature 312: 326-330; 



Toole et al., 1984, Nature 312: 342-347.) has provided more information about its 
gene and protein structure. The full length FVIII cDNA contains an open reading 
frame coding for a polypeptide of 2351 amino acids. This primary translation 
product contains a 19 a.a. signal peptides and six homologous domains in the 

5 order A1-A2-B-A3-C1-C2 (Vehar et al., 1984, Nature 312: 337-342.). The 256- 
kDa precursor protein is proteolytically processed intracellular^ to a metal ion- 
linked heterodimer of a 90- to 200-kDa heavy chain (A1-A2-B) and an 80-kDa 
light chain (A3-C1-C2), which circulates in the plasma bound to the von 
Willebrand factor (vWF). 

10 For the past two decades hemophiliacs have been treated with whole 

plasma, and more recently with highly purified FVIII concentrates prepared from 
plasma. Higher concentrates purity allows the administration of effective doses at 
lower volumes to patients. This therapy is effective in controlling bleeding 
episodes, however, hemophiliacs may suffer from other complications, 

15 compromised by the discovery that these products were transmitting human 
viruses such as the hepatitis B virus and human immuno-deficiency virus (HIV) to 
patients, many of them later developed diseases (Hoyer, 1993, Methods Enzymol. 
222: 169-176). 

Since then, plasma screening and viral inactivation procedures have greatly 
20 improved the safety of these products, although the potential for transmitting 
diseases such as Creutzfeldt- Jakob disease must be considered and has led to 
sporadic shortages in these plasma products. Nonetheless, when non-blood source 
products derived from recombinant DNA biotechnology became available, further 
reducing the viral risk, they were quickly embraced by caregivers and patients, 
25 despite their higher cost (Pipe and Kaufman, 2000, Nature Biotech. 18: 264-265). 
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One of the challenges created by the biotechnology revolution is the development 
of methods for the economical production of highly purified proteins in large 
scales. Recent developments indicate that manipulating milk composition using 
transgenesis has focused mainly on the mammary gland as a bioreactor to produce 
5 pharmaceuticals. 

SUMMARY OF THE INVENTION 

The ability to modify animal genomes through microinjection technology 
has offered new alternatives for the manufacture of recombinant proteins. 

10 Targeting the production of human recombinant protein pharmaceuticals in the 
milk of transgenic animals solved many problems associated with either microbial 
or animal cell expression systems. Bacteria often improperly fold complex 
proteins, introducing more involved and expensive processes. Both bacteria and 
yeast lack adequate post-translational modification. Bioreactors for cell cultures 

15 require high capital expenditures, use large volumes of expensive culture media, 
and often suffer from relatively low yields. 

To express a recombinant protein in the milk of a transgenic animal, 
expression vectors containing a gene encoding the protein of interest fused to milk 
specific regulatory promoter elements are generally introduced by microinjection 

20 into a pronuclear-stage embryo, or alternatively transfecting the expression vector 
into a cell line suitable for somatic cell nuclear transfer. Most of this work has 
been carried out with: the ovine -lactoglobulin, rodent (mouse, rat and rabbit) 
whey acid protein (WAP) genes, bovine a -lactalbumin and a-sl -casein genes, as 
well as the caprine yS -gene. 

25 Several blood proteins, hormones, and enzymes have been synthesized in 
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transgenic animals. Of these, human FVIII is probably the largest and most 
complex protein to be expressed. Analysis of the cloned cDNA for human FVIII 
has provided more information about its gene and protein structure. The full 
length FVIII cDNA contains an open reading frame coding for a large polypeptide 
5 of 2351 amino acids. The 256-kDa precursor hFVIII protein is proteolytically 
processed intracellularly to a metal ion-linked hetero-dimer of a 90- to 200-kDa 
heavy chain (A1-A2-B) and an 80-kDa light chain (A3-C1-C2), which circulates 
in the plasma bound to the von Willebrand factor (vWF). The B-domain, is 
encoded by a single large exon and is highly glycosylated, harboring 19 of the 25 

10 N-linked glycosylation sites. The B-domain is released upon co-factor activation, 
so it is not necessary for clotting function. 

According to the regulatory sequences employed, variable levels of 
expression for the recombinant human FVIII were observed. For example, using a 
construct containing 2.2-kb ovine /3 -lactoglobulin 5'-flanking sequences to drive 

15 the FVIII cDNA with the introns of the murine metallothionein I ( J3 -Lac/hFVIII- 
Mtl), only extremely low level (4-6 ng/ml) of expression was observed in 
transgenic sheep (Niemann et aL, 1999. Transgenic Res 8: 237-247). Whereas the 
hFVIII cDNA, under the control of the 2.5-kb mouse WAP promoter, was shown 
to secrete up to 2.7 ji g/ml in transgenic pig milk (Paleyanda et al., 1997. Nature 

20 Biotech 15: 971-975). 

In this invention, we describe a novel recombination of promoter, leader 
sequence, coding sequences and poly A tail signaling sequence as a expression 
cassette to generate of transgenic animals expressing the fusion gene in the 
mammary gland and the presence of recombinant FVIII protein ranged from 7 to 

25 50 fi g/ml with over 35- to 200-fold higher than in normal human plasma, with 
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temporal and spatial expression profiles for clotting active rFVIII in the milk of 
transgenic animals in different stages of lactation. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 

Fig.lA~lD shows a schematic map of full-length hFVIII transgene 
construction and detection of transgene integrated patterns in transgenic animals. 

Fig.lA The structure of aLA-hFVIII-bGHpA fusion genes are shown and a 
pair of primer sets for PCR screening and a 1.8-kb hFVIII kpnl-fragment as probe 
10 were designed franking the 5 '-end and 3 -end of hFVIII cDNA, respectively. 

Fig. IB Rapid screening positive transgenic mice founders by PCR 
amplification. NC represented a normal mouse DNA as a negative control. 

Fig.lC Screening of positive transgenic goat founders and Fl offspring by 
PCR amplification. NC represented a normal non-transgenic goat DNA as a 
1 5 negative control . 

Fig. ID Southern blot analysis of the integration patterns of transgene DNA 
in transgenic animals. Genomic DNA extracted from different founders of 
transgenic mice tails or transgenic goats' ear tissues were digested with BamHI 
and subjected to probe hybridization. 
20 Fig.2A~2B represents the determination of hFVIII transgene copy numbers 

in 17 transgenic founders using slot-blot hybridization. 

Fig.2A The relative positions of sample loading for standard controls and 
transgenic founders. The standards were made with calculated hFVIII plasmid 
DNA, 0.5 to 20 copy numbers, mixed with 10 ug of normal mouse genomic DNA. 
25 The lanes of transgenic founders were also slotted with 10 ug of undigested 
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genomic DNA. A kpnl fragment of hFVIII cDNA was used as a hybridization 
probe. Hybridization and wash was performed in high stringency condition (70°C) 
to avoid cross-hybridization. Result is shown in Fig.2B. 

Fig.3A~3C provides a schematic map of B-domain deleted hFVIII 
5 transgene construction and detection of transgene junctional sequences. 

Fig.3A The structure of a LA-CN-hFVIII(AB)-bGHpA fusion genes are 
shown and two pairs of primer sets for PCR screening and a 1.8-kb hFVIII kpnl- 
fragment as probe were designed franking the 5'-end, B-domain flanking region 
and 3'-end of hFVIII cDNA, respectively. 
10 Fig.3B DNA sequencing confirmed the in-frame creation of the N-terminal 

a SI -casein signal peptide sequence and the 19-aa leader pepteide-removed 
hFVIII sequence. 

Fig.3C DNA sequencing confirmed the in-frame creation of B-domain 
deletion between hFVIII A-domain (A1-A2) and hFVIII C-domain (A3-C1-C2). 
15 Fig.4A~4C shows the analyzed data of B domain-deleted hFVIII transgenic 

animals. 

Fig.4A The structure of a LA-CN-hFVIII( AB)-bGHpA fusion genes are 
shown and two pairs of primer sets for PCR screening, 273-bp for the 5'-end 
franking region and 751 -bp for the B-domain flanking region, respectively. It also 
20 shows the genomic digested enzymes and predicted patterns in Southern blot 
assay. 

Fig.4B Rapid screening positive transgenic founders by PCR amplification. 
NC represented a normal mouse DNA as a negative control. PC represented a 
hFVm( AB) plasmid DNA as a positive control. 
25 Fig.4C Southern blot hybridization analysis of transgenic goat (Tg-3431) 
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harboring B-domain-deleted hFVIII fusion gene. Normal and transgenic goat 
genomic DNAs were digested with Hindlll, PvuII, and Xbal, individually. 
Arrowheads represented the off-size bands in transgenic goat genome. 

Fig.5 confirms the germ-line transmission of transgenic animals carrying 
5 hFVIII(AB) exogenic DNA in their Fl generation by using PCR detection and 
direct PCR product sequencing. The sequencing result exhibited the intact 
transgenic hFVIII(AB) fragment was acturally present in their Fl offspring's 
genome. 

Fig.6A~6B shows the mammary-specific expression cassette regulated a 
10 temporal and spatial expression of exogenous hFVIII RNA in transgenic animals. 

Fig.6A Tissue-specific expression of a LA-FVIII transgene by RT-PCR 
detection. Tissue was removed from lactating mice at 7 days post partum. RNA 
was isolated from lactating transgenic mouse mammary gland (Ma), heart (H), 
liver (L), lung (Lu), muscle (M), brain (B), prancreas (P), and in male transgenic 
15 mouse mammary fat pad (F). A b-actin primer set was used as internal control. 

Fig.6B Stage-specific expression of a LA-FVIII transgene during pre- 
partum (D-3), lactation (Dl to D22), and weaning (D29 to D36). A 
glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) primer set was used as an 
internal control. The ratios of hFVIII/ GAPDH band intensities were plotted by 
20 densitometer with computer analysis (Bio-Rad video densitomer, model 620). 

Fig.7A~7B shows the Western blot analysis of recombinant hFVIII protein 
secreted from mammary glands of transgenic mice at different lactating stage. 

Fig. 7 A The secretion of rFVIII protein in milks was collected, 10- to 20- 
fold diluted and subjected to western blotting with rabbit anti-hFVIII polyclonal 
25 antiserum. Lane 1 contained the milk protein harvested from non-transgenic mice 
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during lactating stage as a negative control and lane 2 contained HPLC-purified 
hFVIII heavy chain (200-kDa) as a positive control. The orders for D7, D12, D17, 
and D22 represented the milk proteins harvested from various lactating stages of 
transgenic mice. 

5 Fig.7B Western blotting analysis of rFVIII light chain protein with mouse 

anti-hFVIII monoclonal antibody D2. Lane 1 contained the milk protein harvested 
from non-transgenic mice during lactating stage as a negative control and lane 2 
contained the HPLC-purified hFVIII light chain (80-kDa) as a positive control. 

Fig. 8 shows the Western blot analysis of recombinant hFVIII protein 
10 secreted from mammary glands of transgenic goat (Tg-201) at early lactating 
stage. The secretion of rFVIII protein in milks was collected in days 3, 1 1 and 22. 
The collected milks were diluted in 1- to 3 -fold and subjected to Western blotting 
and hybridization with rabbit anti-hFVIII polyclonal antiserum. Lane 1 contained 
HPLC-purified hFVIII heavy chain (200-kDa) as a positive control and lane 2 
15 contained the milk protein harvested from non-transgenic mice during lactating 
stage as a negative control. 

Fig.9A~9C shows the functional assay of transgenic hFVIII protein by 
clotting activity analysis. The coagulant activity of hFVIII was measured by 
reduction of the activated partial thromboplastin time (aPTT) in a one-stage assay 
20 (Over, 1984, Scand J. Haematol. 41 : 13-24.). 

Fig.9A Human FVIII dilutions equivalent to a 1% to 50% dilution of 
normal plasma (NPP) were prepared in 50 mM imidazole buffer. 

Fig.9B NPP standard curve was performed using aPTT clotting assay. 
Fig.9C Control and transgenic goat milks diluted in 50%, 20%, and 10% 
25 were incubated in duplicate at 37°C in hFVIII-deficient plasma, followed by the 



automated addition of aPTT reagent. The clot time, in seconds, was recorded on 
an ST2 Coagulometer (Stago). 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 

The invention relates to transgenic milk that can be used to produce 
therapeutical recombinant hFVHI for hemophilia patent treatment. 

Using mammary gland-specific promoters, a wide range of proteins of bio- 
pharmaceutical interest have been expressed in rodents, pigs, and dairy animals 

10 (Echeland, 1996, Current Opinion in Biotechnology 7: 536; Houdebine et al., 
2000, Transgenic Research 9:505). An expression vector, comprising a gene 
encoding the target protein of interest fused to a milk promoter gene, is introduced 
by microinjection into the pronucleus of a one-cell embryo. Upon germ line 
integration and expression, the transgene becomes a dominant Mendelian genetic 

15 characteristic that is inherited by the progeny of the founder animal. Mammalian 
mammary epithelial cells have the capacity to carry out complex protein synthesis 
with a variety of posttranslational modifications and folding. The milk of 
transgenic livestock presents a starting material from which human diagnostic or 
pharmaceutically-active and therapeutic proteins may be purified using 

20 established technologies. 
Milk Specific Promoters 

A variety of transcriptional promoters that preferentially activate 
transcription in mammary epithelial cells are available. These include the 
promoters that control the genes encoding milk proteins such as caseins (a S1-, 

25 a S2-, p-, y-, and k -casein), beta-lactoglobulin (Clark et al., 1989, 

9 



Bio/Technology 7: 487-492), whey acid protein (Gordon et al., 1987, 
Bio/Technology 5: 1 183-1 187), and alpha-lactalbumin (Soulier et al., 1992, FEBS 
Letters 297: 13). Casein promoters may be derived from the alpha, beta, gamma 
or kappa casein genes of any mammalian species; a preferred promoter is derived 
5 from the goat beta casein gene (DiTullio, 1992, Bio/Technology 10:74-77). 

The DNA sequences of many these promoters are available, e.g., in 
GenBank and in scientific publications such as 1) rat alpha-lactalbumin (Richards 
et al., 1981, J. Biol. Chem. 256: 526-532); 2) rat WAP (Campbell et al., 1984, 
Nucleic Acids Res. 12: 8685-8697); 3) rat alpha-casein (Jones et al., 1985, J. Biol. 

10 Chem. 260: 7042-7050); 4) rat alpha-casein (Lee and Rosen, 1983, J. Biol. Chem. 
258: 10794-10804); 5) human alpha-lactalbumin (Hall, 1987, Biochem. J. 242: 
735-742); 6) bovine alpha-Si casein (Stewart, 1984, Nucleic Acids Res. 12: 389); 
7) bovine alpha-casein (Gorodetsky et al., 1988, Gene 66: 87-96); 8) bovine 
alpha-casein (Alexander et al., 1988, Eur. J. Biochem. 178: 395-401); 9) bovine 

15 alpha-S2 casein (Brignon et al., 1977, FEBS Letters 188: 48-55); 10) bovine 
alpha-lactoglobulin (Jamieson et al., 1987, Gene 61: 85-90; Alexander et al., 1989, 
Nucleic Acids Res. 17: 6739). For additional regulatory control or stringency, 
other regulatory sequences such as adjacent sequences, can be obtained from these 
genes or from homologous genes of other mammals using these promoter 

20 sequence as probes to screen genomic libraries. 
Signal Peptide Sequences 

The recombinant gene construction can also include a signal sequences, 
particularly a signal peptide sequences of a milk specific gene. For the best 
secretion efficiency of forigen protein in the mammary gland, the milk-specific 

25 signal peptide sequence can be the secretional signal sequence which naturally 

10 



occurs with the selected milk-specific promoter used in the construct, which are 
described below. For example, signal sequences from genes coding for caseins, 
e.g., alpha, beta, gamma or kappa caseins, beta lactoglobulin, whey acid protein, 
and lactalbumin can be used. 
5 Transgenesis Constructions 

A 2.0-kb promoter sequence of bovine alpha-lactalbumin ( a LA) was 
generated by PCR amplification using a genomic DNA from high milk-producing 
Holstein cow, which obtained from National Taiwan University Farm, as the 
template. This PCR product containing entire a LA promoter and 19-aa leader 

10 sequence (SEQ ID: No. 1) or replacment with bovine aSl-casein 15-aa signal 
peptide sequence (SEQ ID: No. 2) which created a restriction enzyme, Hpal, 
cloning site in the downstream sequence. These two types of promoter and signal 
peptide sequences were then subsequently inserted into the pCR3 vector 
(Invitrogene, San Diego, CA). 

15 For the full-length hFVIII construction, the resulting 7.0-kb plasmid 

containing a LA promoter and its intact signal peptide sequence was double 
digested with Mlul and PstI and treated with calf intestinal phosphatase 
(Boehringer Mannheim, Indianapolis, IN). The pCMV5/hFVIII plasmid 
containing the intact human FVIII coding sequence was also double digested with 

20 Mlul and PstI. The in frame sequence from the a LA leader peptide through the 
hFVIII junction was determined using the Dye Terminator sequencing system 
(Applied Biosystems Inc., Foster, CA). The 9.7-kb transgene consisting of 2.0-kb 
bovine a LA promoter, 7.2-kb hFVIII cDNA, and 0.5-kb bovine GH gene 
polyadenosine signal sequence was separated from plasmid pCR- a LA/hFVIII-5 

25 using Clal and Xbal digestion and purified for microinjection on the twice CsCl 2 
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gradient ultra-centrifugation. 

For the B domain-deleted hFVIII construction, the resulting 7.0-kb plasmid 
containing a LA promoter and a SI -casein signal peptide sequence was double 
digested with Hpal and Xhol and treated with calf intestinal phosphatase 
5 (Boehringer Mannheim, Indianapolis, IN). The pCMV5/hFVIII plasmid 
containing the intact human FVIII coding sequence was used as template to 
generate a 2233-bp rhFVIII A-domain fragment using degenerate PCR, phFVIII- 
Hpal(+): 5'-GGT TAA CTG CCA CCA GAA GAT A-3' (SEQ ID: NO.3) and 
phFVIII-741aa(-): 5'-AAG CTT CTT GGT TCA ATG GC-3' (SEQ ID: NO.4), 

10 and a 2085-bp rhFVIII C-domain fragment amplified by phFVIIM643aa(+): 5'- 
AAG CTT GAA ACG CCA TCA ACG GGA A-3' (SEQ ID: NO.5) and phFVIII- 
Xhol(-): 5'-CTC GAG CCT CAG TAG AGG TCC TGT-3' (SEQ ID: NO.6), 
respectively. Equal molar ratio of A-domain segment, C-domain segment and 
pCR3-aLA vector were co-ligated and transformed into host competent cells. 

15 The in frame sequence from the a SI -casein leader peptide through the hFVIII 
junction was determined using the Dye Terminator sequencing system (Applied 
Biosystems Inc., Foster, CA). The 6.8-kb transgene consisting of 2. 0-kb bovine 
a LA promoter and aSl-casein leader sequence, 4.3-kb hFVIII cDNA, and 0.5-kb 
bovine GH gene polyadenosine signal sequence was separated from plasmid pCR- 

20 a LA/hFVIII(AB) using Clal and Xbal digestion and purified for microinjection 
on the twice CsCl 2 gradient ultra-centrifugation. 
Transgenic Animal Productions 

The purified transgene was microinjected into the male pronuclei of 
fertilized eggs from superovulated female mice of the outbreed ICR strain and 

25 transferred to recipient pseudo-pregnant females as previously described (Chen et 



al., Transgenic Res 4: 52-59; 1995). For transgenic goat production, the 
pronuclear stage embryos were flushed from the donor goat's oviduct at the one 
and half day after insemination by means of surgical method. In order to obtain 
higher embryo numbers, every embryo-donating goat had been treated with 
5 endocrine so as to achieve the object of superovulation. Such a superovulation 
treatment comprised of administrating intramuscular of follicular stimulating 
hormone (FSH) to the embryo-donating goat sequentially for 4 days since eighth 
day of estrous cycle twice a day with interval of 12 hours and the dosage was 
decreased daily as 4-, 3-, 2-, and 1-mg, respectively. As the first dosage at the 

10 third day, co-administrated with lOOOiu human chorionic gonadotropin (HCG) 
which resulted in detection of estrous after 54 hours whereupon gave two artificial 
insemination (AI) with a interval of 12 hours. Then, at about one and half days 
after conception, the goat one-cell stage embryos were collected with a sterile 
glass capillary tube via surgical embryo flushing method. The collected embryo 

15 was transferred into another petri dish where it was rinsed more than ten times. 
Thereafter, it was placed under a phase contrast microscope at 400X amplification 
for gene microinjection manipulation. After a transient in vitro culture, the 
healthly microinjected embryos were then transferred into recipient oviducts for 
further conceptus development. 

20 Recombinant hFVIII Transgene Determination 

The resulting pups were rapidly screened for the transgene by PCR 
amplification of tail or ear tissue DNA. PCR was performed using one set of 
primer, pa LA-124(+): 5'-CTC TCT TGT CAT CCT CTT CC-3' (SEQ ID: NO.7) 
and phFVIIM49(-): 5'-GGT TAC GCG TCA AGA TTC TGA-3' (SEQ ID: NO.8), 

25 which defined a 273-bp region spanning the a LA promoter, secretion signal 



sequence and hFVIII cDNA junctional sequence. For the B domain-deleted 
hFVIII transgene detection, additional primer pairs was designed, phFVIII- 
ACJ(+) : 5'-AGA CTT TCG GAA CAG AGG CA-3* (SEQ ID: NO.9) and 
phFVIII-ACJ(-): 5'-ATC TTT TTC CAG GTC AAC ATC A-3' (SEQ ID: NO. 10), 

5 which defined a 751 -bp region flanking A-C recombinant junction. The positive 
PGR screening results for transgenic animals were further confirmed by Southern 
blot analysis. Ten micrograms of genomic DNA were individurally digested with 
BamHI, Hindlll, PvuII, and Xbal restriction enzymes at 37 °C overnight, 
electrophoresed on a 0.8% agarose gel, and transferred to a Durose membrane 

10 (Stratagene, La Jolla, CA). A Kpnl fragment of hFVIII-specific cDNA (1.8-kb) 
was used as a radioactive probe to hybridize the membrane. Blots were subjected 
to autoradiography for three days at -20°C . 

Recombinant hFVIII RNA Expression in Transgenic Animals 

The temporal and spatial expression of hFVIII RNA in transgenic animals 
15 was analyzed using a reverse transcription-polymerase chain reaction (RT-PCR). 
Total RNA from different tissues including the heart, liver, lung, muscle, 
mammary gland, brain, pancrease, and kidney of female transgenic mice during 
lactation periods (Day 1 to Day 28 post partum) was extracted using the acid 
guanidinium thiocyanate method (Chomczynski and Sacchi, 1987, Anal. Biochem. 
20 162: 156-159). One microgram of total RNA was treated twice with 10 units of 
DNase I (Gibco BRL, Gaithersburg, MD) and phenol-chloroform extracted. RNA 
pellets were resuspended in 1 5 ft I DEPC-water and then used to synthesize the 
first-strand cDNA with random primers and Superscript reverse transcriptase 
(Gibco BRL, Gaithersburg, MD) in a total volume of 25 ml. The reaction was 
25 carried out at 42°C for 1 hr. 
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For further PCR amplification, an aliquot (1/10) of the RT product was 
adjusted to contain 0.1 fi g of each primer and additional buffer was added for a 
total volume of 50 fi L PCR was performed for 30 cycles (94°C , 1 min; 55 °C , 2 
min; 72°C , 2 min). The primers used included a pair of hFVIII-specific primers, 
phFVIII-F2(+):5 , -CAT TCT ATT CAT TTC AGT GGA CA-3' (SEQ ID: NO.l 1) 
and phFVIII-R2(-): 5'-GAG ATG TAG AGG CTG GAG AAC T-3' (SEQ ID: 
NO. 12), and a pair of glyceraldehydes-3 -phosphate dehydrogenase (GAPDH), as 
well as a pair of £ -actin. Both # -actin and GAPDH are the universal transcripts 
in every cell and were used as the internal controls of RT-PCR. 
I mmunoblot Analysis of Transgenic Milk Proteins 

Milk was collected from lactating females as previously described (Simons 
et al., 1987, Nature 328: 530-532.) and analyzed using SDS-polyacrylamide gel 
electrophoresis (SDS-PAGE) (Cheng et al., 1998, Human Gene Therapy 9: 1995- 
2003). Ten- to twenty-fold dilutions of the collected milk, in 75 mM Tris-HCl 
buffer at pH 6.8, from different lactation periods were diluted in SDS-PAGE 
sample buffer with 5% 2-mercaptoethanol and electrophoresed on 7.5% gel. To 
estimate production levels, HPLC-purified recombinant hFVIII standard was 
diluted to 10 //g/ml in normal mouse milk and electrophoresed alongside the 
transgenic milk samples. Proteins were electrotransferred from the gel to a PVDF 
membrane (NEN Life Science Products, Boston, MA). The blots were probed 
with primary antibodies recognizing hFVIII at 2 to 10 //g/ml and washed with 
phosphate-buffered saline containing 0.1% Tween-20 (PBS-T). Blots were reacted 
with horseradish peroxidase (HRP)-conjugated second antibodies at 0.2 mg/ml. 
Polyclone antibody C6 recognize the heavy chain of hFVIII rang from 80-200 
kDa and monoclone antibody D2 recognize the light chain of hFVIII at 75-80 kDa. 
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The blot was then developed with the chemiluminescent ECLTM detection 
system (Amersham, UK) and exposure to x-ray film. Band intensities were 
compared by densitometry. 

Secretion of hFVIII protein from the transgenic milk was determined 
5 quantitatively with the enzyme-linked immunosorbent assay (ELISA) as 
previously described (Chen et al. 5 1993, J Virol 67: 2142-2148). Briefly, plates 
were coated with capture antibodies (ESH-5 and ESH-8, 500ng each per well; 
American Diagnostics, Greenwich, CT) in carbonate buffer and incubated for 1 
hour at 37 °C. Plates were washed with 0.05% Tween-20 in phosphate-buffered 

10 saline (PBS) and blocked in 50 mM Tris (pH 7.2), 150 mM NaCl, 0.5% gelatin, 
and 0.05% Tween-20 for 2 hour at 37 °C. Purified recombinant FVIII (Hyland, 
Baxter Healthcare, CA), prepared in blocking buffer with 1:50 normal murine 
milk, served as the standard. Samples and standards were incubated for 1 hour at 
37°C. Detection antibody (rabbit anti-human FVIII 1: 10,000 dilution) was added 

15 and allowed to react with hFVIII at 37°C for 1 h. After removing the first 
antibody using four washes with PBS, a second antibody, goat anti-rabbit 
immunoglobulin antibody conjugated with horseradish peroxidase, was added (1: 
3,000 dilution) for a further 1 h of incubation. The plates were washed again 
thoroughly with PBS, and 100 fi 1 of substrate solution (2 mg of 0- 

20 phenylenediamine dissolve in 1 ml of 1M phosphate citrate plus 0.02% H 2 0 2 ) 
was added for development. After 30 min, 100 /a I of 1M H 2 SO A was added 
immediately to stop the reaction and the colors were measured by optical density 
at 492 nm. 

The Feature of a -Lactalbuimn Mammary Expression Cassette 

25 Expression of rFVIII protein in the milk of transgenic animals driven by 
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bovine a -lactalbuimn regulatory sequence seems more efficient than the other 
transgene constructions, which were controlled by ovine -lactoglobulin 
promoter (Niemann et ah, 1999, Transgenic Res 8: 237-247.) as well as mouse 
WAP regulatory sequence (Paleyanda et aL, 1997, Nature Biotech 15: 971-975). 
5 Expression of the bovine a LA gene has been shown to be the most lactation- 
specific of all bovine milk protein genes (Goodman and Schanbacher, 1991, 
Biochem Biophys Res Commun 180:75-84). Use of the a LA 19-aa secretary 
peptide (SEQ ID: NO: 13) or a Sl-casein 15-aa signal peptide (SEQ ID: NO: 14) 
to lead the rFVIII protein secretion and the bGH polyadenosylation signal to 

10 stabilize the steady-state of hFVIII RNA molecules, which may also have 
contributed to our success. Actually, there are several previous reports 
demonstrated that mammary specific transgenes driven by a LA promoter have 
resulted in high levels of gene expression up to 3.7 mg/ml (Hochi et al., 1992, 
Mol Reprod Dev 33: 160-164; Soulier et al., 1992, Fed Eur Biol Soc Lett 297: 13- 

15 20; Maschio et al., 1991, Biochem J 275:459-465) in their milks. This 
characteristic of the bovine a LA gene makes its regulatory elements potentially 
useful as a mammary expression system in transgenic animals. In contrast to the 
caseins and b-lactoglobulin, the production of a LA mRNA increases suddenly at 
paturition, remains elevated during lactation, and drops sharply at dry-off and 

20 during involution. The rFVIII protein profile lead by the a LA promoter and 19- 
aa secretion signal peptide or a Sl-casein 15-aa signal peptide follows a similar 
trend. The lactation specificity of the regulatory regions used to control mammary 
expression in transgenic animals may be important when foreign proteins that 
have biological activities are expressed because these proteins may exhibit their 

25 biological function in the animals if they are secreted before tight junctions of 
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mammary epithelial cells are formed. 

The Feature of hFVIII Protein Expression 

Clotting factor IX (FIX), adenosine deaminase (ADA), and other cDNAs 
were expressed at high levels from retroviral vectors in rat fibroblasts (Miller, 
5 1992, Nature 357: 455-460; Palmer et al., 1991, Proc Natl Acad Sci USA 88: 
1330-13334), whereas the FVIII cDNA was expressed at very low levels from 
these vectors in primary human fibroblast cells (Hoeben et al., 1990, J Biol Chem 
265: 7318-7323; Lynch et al., 1993, Hum Gene Ther 4: 259-272). It has been 
found that FVIII RNA steady-state levels were reduced 100-fold from a FVIII 

10 retroviral vector compared to the same vector expressing other cDNAs, and FVIII 
vector titers were correspondingly reduced 100-fold compared to other vectors 
(Lynch et -al., 1993, Hum Gene Ther 4: 259-272). A large part of the inhibitory 
effect of the FVIII cDNA on expression from a retroviral vector has been 
localized to a 1.2-kb fragment, derived from the A2 and A3 domains, which 

15 decreased the steady-state RNA levels 100- to 200-fold and decreased vector titers 
10-fold (Koeberl et al., 1995, Hum Gene Ther 6: 469-479). The mechanism by 
which the FVIII cDNA inhibitory sequence (INS) decreases RNA and protein 
expression from expression vectors remain to be established. Recently, 
experimental data provided ample evidence that the human FVIII cDNA contains 

20 elements that repress its own expression at the level of transcription (Hoeben et al., 
1995, Blood 85: 2447-2454). Fallaux and coworkers (1996, Mol Cell Biol 16: 
4264-4272) identified a 305-bp region derived from exons 9 to 1 1 that encodes a 
nuclear-matrix attachment region (MAR), also called the scaffold-attached region 
(SAR). Yeast MARs elements can play an important role in transcriptional 

25 silencing (Newlon and Theis, 1993, Transgenic Res 8: 237-247). The B domain 



can be deleted without any significant effect on specific procoagulant activity 
(Pittman et al., 1993, Blood 81:2925-2935; Chao et al., 2000, Blood, 95: 1594- 
1599). B-domain-deleted hFVIII cDNA (BDD-hFVIII) is 4.3 kb and primary 
truncate polypeptide (hFVIIIAB) is 1448 aa (SEQ ID: NO.15) not thought 
5 feasible for testing in transgenic animals previously. 

The present invention is further depicted in the illustration of examples, but 
the descriptions made in the examples should not be construed as a limitation on 
the actual application of the present invention. 

10 EXAMPLE 1 

Generations of transgenic mice, pigs and goats carrying the full-length 
hFVIII gene in mammary-specific expression cassette 

Out of 79 potential transgenic founder mice, 17 were identified as being 
transgenic by the PCR screening (Fig. IB). Two transgenic founder goats carrying 

15 the full-length hFIII also have been identified by PCR screening (Fig. 1C) as well 
as two lines of transgenic pigs have been established. To quantify the transgene 
copy numbers and to understand the integration patterns of the foreign gene 
within the genome the transgenic animals, restriction enzyme BamHI was used to 
digest the genomic DNA, cut once within the transgene, and subjected to Southern 

20 blotting analysis. On hybridization using the 1.8-kb Kpnl fragment of human- 
specific FVIII sequence as a probe (Fig. 1A), a 9.7-kb band representing 
monomelic transgene copies present in head-to-tail joining (H-T) tandem repeats 
was observed in all cases. A 4.6-kb hybridization band was observed in the blots 
of all transgenic mice present in tail-to-tail joining (T-T) inverted repeats (Fig. 

25 ID). Different intensities of the hybridization signal were also observed indicating 

19 
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the presence of different transgene copy numbers in these mice. Further detail 
quantification using slot-blot hybridization (Fig. 2A-2B), they could be 
distinguished into three groups according to the copies of transgene integration. 
For low copy number (defined as 1-5 copies per cell), there were six transgenic 

5 lines including a LAF-7, -15, -33, -36, -43 and -54. Transgenic founders carrying 
middle copy number (10-20 copies per cell) were found in nine lines including a 
LAF-9, -13, -18, -27, -28, -29, -30, -38 and -39. In addition, there were two lines 
( a LAF-8 and -25) carrying high copy number (40-50 copies per cell) of hFVIII 
transgene in their genome. All of the transgenic mice harboring aLA-hFVIII 

10 DNAs presented two-to-three off-size bands (Fig. ID) besides the predicted 
transgene junctions. These bands were probably rearranged transgene sequences, 
some of which were integration sites representing transgene-cellular junctions. 

Breeding lines were established from all transgenic founder animals by 
crossing them with normal ICR mice, Alpine goats (Fig. 1C), or hybride pigs. The 

15 multiple integrated copies of the transgene were found to be stably germ-line 
transmitted among twelve (7 females and 5 males) out of the 17 transgenic 
founders. 

EXAMPLE 2 

20 Generations of transgenic mice and goats carrying the B-domain deleted 
hFVIII gene in mammary-specific expression cassette 

B-domain deleted hFVIII transgene construct driven by a LA promoter 
and a SI -casein secretary peptide sequence was shown in Fig. 3 A. In N-terminal 
of rhFVIII polypeptide, a 15-aa a SI -casein signal peptide was added for leading 

25 newly synthesized rhFVIII secretion into mammary ductal cavity (Fig. 3B). In the 
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middle of rFVIII polypeptide, the A-domain (741 -residue) has been created to join 
the C-domain (1643 -residue) for completely deleted the B-domain segment (Fig. 
3C). 

Out of 65 potential transgenic founder mice, 15 were identified as being 
transgenic by the two sets of primers, one pair of primer located in 5 f -promoter 
junction, the other pair of primer flanking the B-domain deletion region (Fig. 4A). 
The PCR screening was shown in Fig. 4B. To quantify the transgene copy 
numbers and to understand the integration patterns of the foreign gene within the 
genome of the transgenic mice, restriction enzyme Hindlll, PvuII, and Xbal were 
used to digest the genomic DNA, cut once within the transgene, and subjected to 
Southern blotting analysis (Fig. 4C). The result showed that transgenic goat (Tg- 
3431) harboring B-domain-deleted hFVIII fusion gene clearly exhibited one to 
two off-size bands in their genome when compairing with normal goat genome. 

Germ-line transmission ability of transgenic animals carrying hFVIII(AB) 
exogenic DNA in their Fl generation was confirmed by using PCR detection and 
direct PCR product sequencing. The sequencing result exhibited the transgenic 
hFVIII(AB) fragment was acturally present in their Fl offspring's genome of 
transgenic goats (Fig. 5). 

EXAMPLE 3 

Temporal and spatial expression of exogenous hFVIII RNA in transgenic 
animals driven by mammary-specific expression cassette 

To characterize the spatial expression of the new developed mammary- 
specific expression cassette containing hFVIII in this invention, total tissue RNAs 
were extracted from different organs in the breeding lines established from the 
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transgenic founder mice. Reverse-transcriptional polymerase chain reaction (RT- 
PCR) analysis was performed using two pair of primer sets, one was phFVIII- 
F2(+) (SEQ ID: NO.ll) and phFVIII-R2(-) (SEQ ID: NO.12) specific to human 
FVIII cDNA franking 3 '-coding region, the other was b-actin primer set as an 
internal control. As shown in Fig. 6 A, the transgene expressed transcriptant of the 
410-bp hFVIII RT-PCR product was found in the mammary gland of lactating 
transgenic mouse. No homologous transcripts were detectable in the heart (H), 
liver (L), lung (LU), muscle (M), brain (B), prancreas (P), or in male transgenic 
mouse mammary fat pads (F). All examined tissues showed a 420-bp amplified 
fragment for mouse b-actin to assess the integrity and quantity of RNA in each 
sample. The efficiency of DNase I treatment to eliminate DNA contamination was 
determined using RNA from a transgenic mammary gland. When the reverse 
transcriptase was omitted from the reaction, no amplification was observed (Fig. 
6A, last lane). 

Temporal expression studies were performed on the Fl transgenic mice in 
the breeding lines established from founder mice a LAF-15 and -18. Total RNAs 
were extracted at different physiological states in the transgenic mammary gland 
using tissue biopsies subjected to RT-PCR detection. The results showed that the 
recombinant hFVIII mRNA was expressed in all lactating periods (Dl, D8, D15 
and D22, shown in Fig. 6B), whereas the same transcripts were not detectable in 
the later stages of pregnancy before parturition (D-3) and in the weaning stage 
after lactation (D29 and D36, in Fig. 6B). The 2.0-kb bovine a LA promoter had 
a highly tissue-specific and stage-specific gene regulation activity in the 
mammary epithelial cells during the lactation period. 

To quantify the level of hFVIII mRNA in the mammary glands of 
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transgenic mice, densitometric analysis was performed, comparing the intensity of 
the 250-bp band in each lane normalized to the level of signal generated upon co- 
amplification of the mouse glyceraldehyde-3-phosphate dehydrogenase (GAPDH) 
primers. The intensity of the hFVIII transcript increased eight-fold and ten-fold 
5 upon Day 8 and Day 15 post-parturition, respectively, while a two-fold decrease 
was present on Day 22 of the later lactation (Fig. 6B), compared to Day 1 of the 
newly lactating stage. 

EXAMPLE 4 

10 Expression and secretion of recombinant hFVIII proteins into the 
transgenic milks 

To determine the production of recombinant hFVIII and to compare the 
secretion efficiency of rFVIII among the different transgenic mouse lines, milk 
was collected from five lines of FO, Fl and F2 females during lactation. Proteins 

15 from both non-transgenic control and transgenic skim milks were separated in 
SDS-PAGE and followed by Western blot analysis (Fig. 7A-7B). No 
polypeptides characteristic of recombinant hFVIII were detected in control mouse 
milk using the hFVIII-specific polyclonal antibody. In transgenic milk, rFVIII 
was detected as a heterogenous group of polypeptides of approximately 80-200 

20 kDa, of which the 92-200 kDa polypeptides represent the heavy chain and its 
proteolytic cleavage products or different glycosylational modification patterns. 
The small molecule of an 80 kDa corresponding to the light chain containing 
hFVIII C2 domain was also detected by this antibody (Fig. 7A). In similar 
experiments using monoclonal antibodies that recognize specific hFVIII light 

25 chain C2 domain structures, a specific polypeptide corresponding in size to the 80 
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kDa was observed as compared with the HPLC-purified hFVIII C2 domain 
peptide (Fig. 7B). 

In the transgenic goat (Tg-201), milk was collected from days 3, 1 1 and 22 
during early lactation stage for Western blot assay (Fig. 8). No polypeptides 
5 characteristic of recombinant hFVIII were detected in control goat milk using the 
hFVIII-specific polyclonal antibody. In transgenic milk, rFVIII was detected as a 
heterogenous group of polypeptides of approximately 80-200 kDa, which 
represent the heavy chain and its proteolytic cleavage products or different 
glycosylational modification patterns. 

10 

EXAMPLES 

Biological function assay for clotting activity of recombinant hFVIII derived 
from transgenic milks 

The coagulant activity of hFVIII was measured by reduction of the 
15 activated partial thromboplastin time (APTT) in a one-stage assay (Over, 1984, 
Scand J. Haematol. 41: 13-24.). Human FVIII dilutions equivalent to a 1:10 to 
1:120 dilution of normal plasma were prepared in 50 mM imidazole buffer, pH 
7.4, containing lOOmM sodium choride, 0.1% BSA and 0.01% Tween-20. Control 
and transgenic mouse milks diluted 1:20, 1:30, and 1:40 were incubated in 
20 duplicate at 37 °C in hFVIII-deficient plasma (Sigma Diagnostics, St. Louis, 
MO), followed by the automated addition of APTT reagent (Dade Behring, 
Marburg, Germany). Clot formation was initiated by the addition of 35 mM 
CaCl 2 . The clot time, in seconds, was recorded on an ST2 Coagulometer (Stago). 
Semi-logarithmic analysis was used to plot the clot time versus amount of FVIII. 
25 Control mouse milk gave results similar to the reagent blank. 



The concentrations of rFVIII ranged from 7.0 to 50.2 //g/ml, over 35- to 
200-fold higher than that in normal human plasma (Table 1). The expression 
levels of rFVIII from three groups on different line carrying low, middle and high 
copy numbers of transgenes were not significantly different. Recombinant hFVIII 

5 production generally tended to increase through the lactation period, but dropped 
dramatically at the end of lactation. This result was in parallel with the RNA 
transcript assay (Fig. 6A-6B) as well as the typical mouse milk production curve, 
mouse milk yield increased throughout lactation, peaked approximately D15 of 
lactation, and then decreased dramatically after D20. 

10 The functional activity of rFVIII was examined using one-stage clotting 

assays (Table 1) and this approach demonstrated that rFVIII secreted in the milk 
of transgenic mice was biologically active. In the clotting assay, the addition of 
transgenic mouse milk, but not control milk, to FVIII-deficient human plasma 
resulted in the restoration of normal coagulant activity. Up to 13.41 U/ml of 

15 rFVIII procoagulant activity was detected. The activity of rFVIII was detected 
throughout lactation as shown in Table 1. The same result for rFVIII clotting 
activity derived from transgenic goat also has been demonstrated and showed in 
Fig. 9A-9C. 
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Table 1. Recombinant hFVIII concentration and coagulation activity in the milk of 
transgenic animals with different groups of trans gene copy, lines and generations 



No. of trans genie 
rFVIII concentration 
mice 



Lactations and 



days (L/D) 



Coagulation activity 
assay (U/ml) a (ug/ml) 



Low copy ground- 5 copies) 

aLAF15 (Fo) L1/D7 
(n=l) L1/D12 

L1/D19 

aLAF36 (Fl) L1/D7 
(n=4) L1/D17 

L2/D7 
L2/D17 

Middle copy ground 0-20 copies) 
aLAF18 (Fl) L1/D7 
(n=3) L1/D12 

L1/D17 

aLAF28 (F2) L1/D7 
(n=3) L1/D12 

L2/D17 

High copy group(40-50copies) 
aLAF25 (Fl) L1/D8 
(n=3) L1/D15 

L1/D21 

aLAF25 (F2) L1/D8 
(n=3) L1/D15 

L1/D21 



4.82 ± 0.43 
7.11 ±0.76 
3.60 ±0.51 
6.38 ± 1.85 
5.64 ± 2.66 
13.41 ±2.87 
9.46 ± 2.04 

0.98 ± 0.39 
6.14 ±0.98 

8.91 ±2.15 
3.01 ± 0.95 

6.55 ± 2.07 
7.81 ±2.53 

11.28 ±2.33 
8.72 ± 1.79 

1.56 ±0.24 

7.92 ±2.01 
8.58 ±2.78 
1.22 ±0.61 



22.84 ± 1.65 
30.58 ± 1.37 
19.87 ±0.96 
29.26 ±3.58 
24.92 ± 3.04 
50.21 ± 5.73 
40.84 ± 4.91 

7.03 ± 2.56 

32.07 ±4.30 
41.37 ±7.78 

9.23 ± 3.74 
35.56 ±6.08 
38.11 ±5.72 

46.33 ± 4.57 
32.92 ± 3.97 

15.08 ±4.84 
39.29 ±6.89 
43.48 ±5.21 
13.32 ±5.05 



a One unit of rFVTII was denned as equivalent to the amount of human FVin normally present in 1ml 
of plasma, approximately 200 ng. Results presented are the average of two independent assays 
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THE ADVANTAGES OF THE INVENTION 

1. The new developed mammary-specific expression cassette containing three 
important parts: . 

(1) 2.0-kb bovine a LA gene promoter which has been shown to be the most 
5 lactation-specific of all bovine milk protein genes; 

(2) use of the a Sl-CN 15-aa or a LA 19-aa as a secretary peptide to lead 
the rFVHI protein secretion; and 

(3) the bGH polyadenosylation signal to stabilize the steady-state of hFVIII 
RNA molecules, which can guaranty the highly expression and efficiently 

10 secretion of any exogenic proteins. 

2. Transgenic expression delivers the advantages of mammalian cells such as a 
sophisticated refolding machinery and glycosylation. 

3. Transgenic productions are scale-up flexibility since the herd size can be 
increased rapidly and inexpensively, as well as the relative low-cost and low 

15 complexity of the raw product manufacturing facilities as compared to more 

traditional cell culture facilities. 

4. Biological active recombinant hFVIII protein derived from transgenic milks is 
easily collected by diary automatic milk collection system and simply purified 
procedure to obtain the massive recombinant proteins. 

20 5. Transgenic mice, goats and pigs expressing high levels of rFVHI either in full- 
length gene constructs or in B domain-deleted hFVIII will be a great value 
system for the mass production of biologically active rFVHI in transgenic 
farm animals. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(iii) NUMBER OF SEQUENCES: 15 

5 (2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: artificial synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATGATGTCCT TTGTCTCTCT GCTCCTGGTA GGCATCCTAT TCCATGCCAC 
CCAGGCTGTT AAC 63 

15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 nucleotides 

(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: artificial synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

28 



ATGAAACTTC TCATCCITAC CTGTCTTGTG GCTGTTGCTG CCAGGTTAAC 50 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 22 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 

1 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 : 

GGTTAACTGC CACCAGAAGA TA 22 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
1 5 (A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

AAGCTTCTTG GTTCAATGGC 20 

(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AAGCTTGAAA CGCCATCAAC GGGAA 25 

10 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTCGAGCCTC AGTAGAGGTC CTGT 24 

20 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CTCTCTTGTC ATCCTCTTCC 20 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 nucleotides 

(E) TYPE: nucleic acid 

(F) STRANDEDNESS: single 

(G) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGTTACGCGT CAAGATTCTG A 21 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGACTTTCGG AACAGAGGCA 20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATCTTTTTCC AGGTCAACAT CA 22 

(2) INFORMATION FOR SEQ ID NO: 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CATTCTATTC ATTTCAGTGG ACA 23 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 nucleotides 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GAGATGTAGA GGCTGGAGAA CT 22 

10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: Amino acid 

1 5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLCULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

20 Met Met Ser Phe Val Ser Leu Leu Leu Val Gly He Leu Phe His 
15 10 15 

Ala Thr Glu Ala 
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(3) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(E) LENGTH: 15 amino acids 

(F) TYPE: Amino acid 

5 (G)STRANDEDNESS: single 

(H) TOPOLOGY: linear 

(ii) MOLCULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
Met Lys Leu Leu He Leu Thr Cys Leu Val Ala Val Ala Ala Arg 
10 1 5 10 15 

(3 )INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(I) LENGTH: 1448 amino acids 
15 (J) TYPE: Amino acid 

(K)STRANDEDNESS: single 
(L) TOPOLOGY: linear 

(ii) MOLCULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

20 Met Lys Leu Leu He Leu Thr Cys Leu Val Ala Val Ala Ala Arg 
15 10 15 

Leu Thr Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
20 25 30 

34 



Trp Asp Tyr Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala 

35 40 45 

Arg Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser 

50 55 60 

5 Val Val Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu 

65 70 75 

Phe Asn He Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly 

80 85 90 

Pro Thr He Gin Ala Glu Val Tyr Asp Thr Val Val He Thr Leu 
10 95 100 105 

Lys Asn Met Ala Ser His Pro Val Ser Leu His Ala Val Gly Val 

110 115 120 

Ser Tyr Trp Lys Ala Ser Glu Gly Ala Glu Tyr Asp Asp Gin Thr 

125 130 135 

15 Ser Gin Arg Glu Lys Glu Asp Asp Lys Val Phe Pro Gly Gly Ser 

140 145 150 

His Thr Tyr Val Trp Gin Val Leu Lys Glu Asn Gly Pro Met Ala 

155 160 165 

Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser His Val Asp 
20 170 175 180 

Leu Val Lys Asp Leu Asn Ser Gly Leu He Gly Ala Leu Leu Val 

185 190 195 
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Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gin Thr Leu His 

200 ' 205 210 

Lys Phe He Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp 

215 220 225 

5 His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp Ala Ala 

230 235 240 

Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val 

245 250 255 

Asn Arg Ser Leu Pro Gly Leu He Gly Cys His Arg Lys Ser Val 

10 260 265 270 

Tyr Trp His Val He Gly Met Gly Thr Thr Pro Glu Val His Ser 

275 280 285 

He Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gin 

290 295 300 

15 Ala Ser Leu Glu He Ser Pro He Thr Phe Leu Thr Ala Gin Thr 

305 310 315 

Leu Leu Met Asp Leu Gly Gin Phe Leu Leu Phe Cys His He Ser 

320 325 330 

Ser His Gin His Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser 

20 335 340 345 

Cys Pro Glu Glu Pro Gin Leu Arg Met Lys Asn Asn Glu Glu Ala 

350 355 360 
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Glu Asp Tyr Asp Asp Asp 
365 

Arg Phe Asp Asp Asp Asn 
380 

5 Val Ala Lys Lys His Pro 

395 

Glu Glu Glu Asp Trp Asp 
410 

Asp Arg Ser Tyr Lys Ser 
10 425 

He Gly Arg Lys Tyr Lys 
440 

Glu Thr Phe Lys Thr Arg 
455 

15 Leu Gly Pro Leu Leu Tyr 

470 

He Phe Lys Asn Gin Ala 
485 

Gly He Thr Asp Val Arg 
20 500 

Gly Val Lys His Leu Lys 
515 



Leu Thr Asp Ser Glu Met Asp Val Val 

370 375 

Ser Pro Ser Phe He Gin He Arg Ser 

385 390 

Lys Thr Trp Val His Tyr He Ala Ala 

400 405 

Tyr Ala Pro Leu Val Leu Ala Pro Asp 

415 420 

Gin Tyr Leu Asn Asn Gly Pro Gin Arg 

430 435 

Lys Val Arg Phe Met Ala Tyr Thr Asp 

445 450 

Glu Ala He Gin His Glu Ser Gly He 

460 465 

Gly Glu Val Gly Asp Thr Leu Leu He 

475 480 

Ser Arg Pro Tyr Asn He Tyr Pro His 

490 495 

Pro Leu Tyr Ser Arg Arg Leu Pro Lys 

505 510 

Asp Phe Pro He Leu Pro Gly Glu He 

520 • 525 
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Phe Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys 

530 535 540 

Ser Asp Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn 

545 550 555 

5 Met Glu Arg Asp Leu Ala Ser Gly Leu He Gly Pro Leu Leu lie 

560 565 570 

Cys Tyr Lys Glu Ser Val Asp Gin Arg Gly Asn Gin lie Met Ser 

575 580 585 

Asp Lys Arg Asn Val He Leu Phe Ser Val Phe Asp Glu Asn Arg 

10 590 595 600 

Ser Trp Tyr Leu Thr Glu Asn He Gin Arg Phe Leu Pro Asn Pro 

605 610 615 

Ala Gly Val Gin Leu Glu Asp Pro Glu Phe Gin Ala Ser Asn He 

620 625 630 

15 Met His Ser He Asn Gly Tyr Val Phe Asp Ser Leu Gin Leu Ser 

635 640 645 

Val Cys Leu His Glu Val Ala Tyr Trp Tyr He Leu Ser He Gly 

650 655 660 

Ala Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr Thr Phe 

20 665 670 675 

Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro Phe 

680 685 690 
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Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp 

695 700 705 

He Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr 

710 715 720 

5 Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr 

725 730 735 

Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys 

740 745 750 

Asn Asn Ala He Glu Pro Arg Ser Leu Lys Arg His Gin Arg Glu 

10 755 760 765 

He Thr Arg Thr Thr Leu Gin Ser Asp Gin Glu Glu He Asp Tyr 

770 775 780 

Asp Asp Thr He Ser Val Glu Met Lys Lys Glu Asp Phe Asp He 

785 790 795 

15 Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys Lys 

800 805 810 

Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp Asp Tyr 

815 820 825 

Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin Ser 

20 830 835 840 

Gly Ser . Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr 

845 850 855 
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Asp Gly Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu 

860 865 870 

His Leu Gly Leu Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp 

875 880 885 

5 Asn He Met Val Thr Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser 

890 895 900 

Phe Tyr Ser Ser Leu lie Ser Tyr Glu Glu Asp Gin Arg Gin Gly 

905 910 915 

Ala Glu Pro Arg Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr 

10 920 925 930 

Tyr Phe Trp Lys Val Gin His His Met Ala Pro Thr Lys Asp Glu 

935 940 945 

Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu 

950 955 960 

15 Lys Asp Val His Ser Gly Leu lie Gly Pro Leu Leu Val Cys His 

965 970 975 

Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gin Val Thr Val Gin 

980 985 990 

Glu Phe Ala Leu Phe Phe Thr He Phe Asp Glu Thr Lys Ser Trp 

20 995 1000 1005 

Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn 

1010 1015 1020 
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He Gin Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His 

1025 1030 1035 

Ala He Asn Gly Tyr He Met Asp Thr Leu Pro Gly Leu Val Met 

1040 1045 1050 

5 Ala Gin Asp Gin Arg He Arg Trp Tyr Leu Leu Ser Met Gly Ser 

1055 1060 1065 

Asn Glu Asn He His Ser He His Phe Ser Gly His Val Phe Thr 

1070 1075 1080 

Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr 

10 1085 1090 1095 

Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly 

1100 1105 1110 

He Trp Arg Val Glu Cys Leu He Gly Glu His Leu His Ala Gly 

1115 1120 1125 

15 Met Ser Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro 

1130 1135 1140 

Leu Gly Met Ala Ser Gly His He Arg Asp Phe Gin He Thr Ala 

1145 1150 1155 

Ser Gly Gin Tyr Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His 

20 1160 1165 1170 

Tyr Ser Gly Ser lie Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser 

1175 1180 1185 
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Trp He Lys Val Asp Leu Leu Ala Pro Met He He His Gly lie 

1190 1195 1200 

Lys Thr Gin Gly Ala Arg Gin Lys Phe Ser Ser Leu Tyr lie Ser 

1205 1210 1215 

5 Gin Phe He lie Met Tyr Ser Leu Asp Gly Lys Lys Trp Gin Thr 

1220 1225 1230 

Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn 

1235 1240 1245 

Val Asp Ser Ser Gly He Lys His Asn He Phe Asn Pro Pro He 

10 1250 1255 1260 

He Ala Arg Tyr He Arg Leu His Pro Thr His Tyr Ser He Arg 

1265 1270 1275 

Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys 

1280 1285 1290 

15 Ser Met Pro Leu Gly Met Glu Ser Lys Ala He Ser Asp Ala Gin 

1295 1300 1305 

He Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser 

1310 1315 1320 

Pro Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp 

20 1325 1330 1335 

Arg Pro Gin Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe 

1340 1345 1350 
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Gin Lys Thr Met Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys 

1355 1360 1365 

Ser Leu Leu Thr Ser Met Tyr Val Lys Glu Phe Leu He Ser Ser 

1370 1375 1380 

Ser Gin Asp Gly His Gin Trp Thr Leu Phe Phe Gin Asn Gly Lys 

1385 1390 1395 

Val Lys Val Phe Gin Gly Asn Gin Asp Ser Phe Thr Pro Val Val 

1400 1405 1410 

Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Tyr Leu Arg lie His 

1415 1420 1425 

Pro Gin Ser Trp Val His Gin He Ala Leu Arg Met Glu Val Leu 

1430 1435 1440 

Gly Cys Glu Ala Gin Asp Leu Tyr 

1445 
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